Update tokenizer apply_chat_template with return_dict=True default by albertvillanova · Pull Request #4448 · huggingface/trl

albertvillanova · 2025-11-04T07:52:21Z

Pass explicitly return_dict=True to apply_chat_template and get its input_ids key.

This PR fixes the issue:

RuntimeError: Could not infer dtype of dict

Note that transformers has recently set return_dict=True as the default value:

[v5] Return a BatchEncoding dict from apply_chat_template by default transformers#41626

This PR updates the tokenization logic in the tokenize_fn function of trl/trainer/reward_trainer.py to improve compatibility with the default output format of apply_chat_template. Instead of assuming return_dict=False and directly returning the result, it now requests a dictionary and extracts only the input_ids field.

Tokenization logic update:

The apply_chat_template method is now called with return_dict=True, and only the input_ids from the returned dictionary are used for both the chosen and rejected examples.

HuggingFaceDocBuilderDev · 2025-11-04T07:55:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec

I was expecting more, cool!

commit 7a9592b Author: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com> Date: Tue Nov 4 14:32:04 2025 -0700 🐍 Drop Python 3.9 (huggingface#4183) commit 7f15a7f Author: Harras Mansoor <98635627+Harras3@users.noreply.github.com> Date: Wed Nov 5 02:06:31 2025 +0500 Removed outdated warning about batch contamination (huggingface#4423) commit 8b0a3ce Author: Albert Villanova del Moral <8515462+albertvillanova@users.noreply.github.com> Date: Tue Nov 4 21:37:39 2025 +0100 Update tokenizer apply_chat_template with return_dict=True default (huggingface#4448) commit d9f9e2b Author: Pramodith Ballapuram <16939722+pramodith@users.noreply.github.com> Date: Tue Nov 4 19:56:58 2025 +0000 Support casting to fp32 when word embeddings are tied to lm_head (huggingface#4446) commit 4e138ab Author: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com> Date: Tue Nov 4 15:15:23 2025 +0100 Upload notebook with T4 selected (huggingface#4449)

commit 4677cf2 Author: Harras Mansoor <98635627+Harras3@users.noreply.github.com> Date: Wed Nov 5 04:06:13 2025 +0500 Removed Sentiment Tuning Examples (#4424) commit 7a9592b Author: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com> Date: Tue Nov 4 14:32:04 2025 -0700 🐍 Drop Python 3.9 (#4183) commit 7f15a7f Author: Harras Mansoor <98635627+Harras3@users.noreply.github.com> Date: Wed Nov 5 02:06:31 2025 +0500 Removed outdated warning about batch contamination (#4423) commit 8b0a3ce Author: Albert Villanova del Moral <8515462+albertvillanova@users.noreply.github.com> Date: Tue Nov 4 21:37:39 2025 +0100 Update tokenizer apply_chat_template with return_dict=True default (#4448) commit d9f9e2b Author: Pramodith Ballapuram <16939722+pramodith@users.noreply.github.com> Date: Tue Nov 4 19:56:58 2025 +0000 Support casting to fp32 when word embeddings are tied to lm_head (#4446) commit 4e138ab Author: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com> Date: Tue Nov 4 15:15:23 2025 +0100 Upload notebook with T4 selected (#4449) commit 43253b2 Author: Pramodith Ballapuram <16939722+pramodith@users.noreply.github.com> Date: Mon Nov 3 21:07:31 2025 +0000 Add On-Policy Distillation from thinking labs to paper index. (#4410) Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com> commit 6f41b18 Author: Behrooz Azarkhalili <80390531+behroozazarkhalili@users.noreply.github.com> Date: Mon Nov 3 10:57:51 2025 -0800 fix: Remove chat template setting from non-SFT trainer scripts (#4437) Co-authored-by: Quentin Gallouédec <gallouedec.quentin@gmail.com> Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

Pass explicitly return_dict=True to apply_chat_template

7ec322b

albertvillanova changed the title ~~Pass explicitly return_dict=True to apply_chat_template~~ Update tokenizer apply_chat_template with return_dict=True default Nov 4, 2025

qgallouedec approved these changes Nov 4, 2025

View reviewed changes

qgallouedec merged commit 8b0a3ce into huggingface:main Nov 4, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update tokenizer apply_chat_template with return_dict=True default#4448

Update tokenizer apply_chat_template with return_dict=True default#4448
qgallouedec merged 1 commit intohuggingface:mainfrom
albertvillanova:fix-4447

albertvillanova commented Nov 4, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Nov 4, 2025

Uh oh!

qgallouedec left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

albertvillanova commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Nov 4, 2025

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

albertvillanova commented Nov 4, 2025 •

edited

Loading